Effective dimensionality

Dimensionality reduction | Intrinsic dimension

Formally, we can think of effective dimensionality of a dataset by considering the Eigenvalues of the Covariance matrix.

With the pseudoprobability defined by eigenvalues,

pk=λkk=1Kλk p_k = \frac{\lambda_k}{\sum_{k'=1}^{K}\lambda_k'}

We can calculate the entropy

H(p)=k=1Kpklog(pk) H(\mathbf{p}) = -\sum_{k=1}^{K}p_k \log(p_k)

then the effective dimensionality dd can be calculated by

d=exp[H(p)] d = \exp \left[ H(\mathbf{p}) \right]

from

k=1d1dlog(1d)=H(p) - \sum_{k=1}^{d} \frac{1}{d} \log \left( \frac{1}{d} \right) = H(\mathbf{p})